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Background Intrauterine growth restriction is a cause of neonatal 
morbidity and mortality. A variety of definitions of low 
birthweight are used in clinical practice, with a lack of consensus 
regarding which definitions best predict adverse outcomes. 

Objectives To evaluate the relationship between birthweight 
standards and neonatal outcome in term-born infants (at > 37 weeks 
of gestation) . 

Search strategy MEDLINE (1966-January 2011), EMBASE (1980- 
January 2011), and the Cochrane Library (2011:1) and MEDION 
were included in our search. 

Selection criteria Studies comprising live term-born infants 
(gestation > 37 completed weeks), with weight or other 
anthropometric measurements recorded at birth along with 
neonatal outcomes. 

Data collection and analysis Data were extracted to populate 
2x2 tables relating birthweight standard with outcome, and 
meta-analysis was performed where possible. 



Main results Twenty- nine studies including 21 034 114 neonates 
were selected. Absolute birthweight was strongly associated with 
mortality, with birthweight < 1.5 kg giving the largest association 
(OR 48.6, 95% CI 28.62-82.53). When using centile charts, 
regardless of threshold, the summary odds ratios were significant 
but closer to 1 than when using absolute birthweight. For all tests, 
summary predictive ability comprised high specificity and positive 
likelihood ratio for neonatal death, but low sensitivity and a 
negative likelihood ratio close to 1. 

Author's conclusions Absolute birthweight is a prognostic factor 
for neonatal mortality. The indirect evidence suggests that centile 
charts or other definitions of low birthweight are not as strongly 
associated with mortality as the absolute birthweight. Further 
research is required to improve predictive accuracy. 

Keywords Fetal grovrth restriction, low birthweight, neonatal 
morbidity, neonatal mortality, systematic review. 
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Introduction 

Intrauterine growth restriction remains a significant prob- 
lem in current obstetric and neonatal practice, and is a sig- 
nificant cause of perinatal mortality and morbidity. ^'^ 
Statistically 'normal' birthweight is defined as being within 
a range around the central tendency (e.g. centile ranges). 
This simple approach has many deficiencies. Clinically, 



infants who are of low birthweight may belong to one of 
four groups. There are those that suffer intrauterine growth 
restriction, whereby the fetus does not achieve their growth 
potential because of environmental factors, such as placen- 
tal insufficiency or maternal health status. Others may 
have a structural or chromosomal abnormality that affects 
their growth.^ Another group of infants who have low 
birthweight are those that are constitutionally small. These 
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babies reach their growth potential; they are not subject to 
a pathological process.'' Low birthweight also refers to 
babies who are normally grown but are born prematurely. 
Prematurity is independently associated with increased 
mortality and long-term morbidity.^'*' 

A number of methods have been used to attempt to 
identify infants who are most at risk of adverse outcomes, 
including neonatal morbidity and mortality. These include: 
population-based centile charts, with the most commonly 
used threshold being the tenth centile^; customised charts, 
where the mother's body mass index (BMI) and ethnicity 
are used to calculate individualised growth centiles**; and 
ponderal index, which takes into account neonatal weight 
and length.' The published associations between each stan- 
dard for defining growth restriction and adverse outcome 
vary, and there is no current consensus regarding the best 
method.^" Within current UK practice a variety of different 
population and customised centile charts are used antena- 
tally, with a different growth chart used for the postnatal 
period, and with the absolute birthweight (<2.5 kg) often 
being used to determine the need for increased care or 
observation in the neonatal period. 

The aim of this systematic review was to re-examine the 
association between measures of low birthweight, including 
absolute birthweight and other anthropometric measure- 
ments, such as ponderal index, with adverse neonatal out- 
comes. We attempted to avoid the confounding influence 
of prematurity and to determine which definition of 
growth restriction has the strongest prognostic association 
with, and is the best predictor for, subsequent morbidity 
and mortality. 

In this article, the term 'prognostic' refers to the strength 
of association between a birthweight test and the odds of 
an adverse outcome, as measured by an odds ratio. The 
term 'predictive' refers to the abUity of a test to discrimi- 
nate between babies who will and babies who will not 
experience an adverse outcome, as measured by sensitivity, 
specificity, and positive and negative likelihood ratios. 
A test may have strong prognostic ability, but not necessar- 
ily good predictive ability, and so it is important to 
consider both." 

Methods 

A protocol-driven systematic review was performed using 
widely recommended methods for reviews (Appendix Sl),'^ 
and is reported according to the MOOSE (meta-analysis of 
observational studies in epidemiology) guidance. This 
study was performed as part of a larger systematic review 
to determine the association of birthweight standards with 
outcomes throughout life, and therefore the search strate- 
gies and study selection process refer to the studies 
included for the overall project. The articles relating to 



outcomes in the neonatal period are reported in this arti- 
cle, whereas those relating to chUdhood and adult out- 
comes are reported separately. 

Sources 

We searched MEDLINE (1966-January 2011), EMBASE 
(1980-January 2011), the Cochrane Library (2011:1), and 
MEDION for relevant published articles. In order to iden- 
tify 'grey' literature, OpenGrey and Web of Science were 
also searched for relevant citations. In MEDLINE the 
search consisted of a combination of medical subject head- 
ings (MeSHs; e.g. infant, small for gestational age, fetal 
growth retardation), keywords (e.g. intrauterine growth 
retardation, low birthweight), and word variants using the 
Boolean operator 'OR' for capturing citations of the rele- 
vant text. These were combined using 'AND' with a combi- 
nation of MeSHs (e.g. human development, infant 
mortality, diabetes mellitus), keywords (e.g. developmental 
delay, handicap, cardiovascular disease), and word variants 
to capture relevant outcomes. The search was restricted to 
human studies, but no language restrictions were applied. 
The MEDLINE search strategy (Appendix S2) was adapted 
for use in other electronic databases. Hand searching of 
recent major journals was also performed. The search was 
performed by two investigators: R.K.M. and G.M. A com- 
prehensive database collating all citations was constructed 

using REFERENCE MANAGER 12.0. 

Study selection and data extraction 

Initially, the database was scrutinised by two reviewers 
(R.K.M. or G.M., partly in duplicate), and full articles of 
all citations that were likely to meet the predefined selec- 
tion criteria were obtained. Articles in languages other than 
English were translated. Final inclusion or exclusion deci- 
sions were made after examination by two reviewers (G.M. 
and R.K.M.) in accordance with the most recent guid- 
ance,'^ and with strict adherence to the following criteria. 

• Population: Live-born infants who have had weight or 
other anthropometric measurements recorded at birth and 
were born at term (gestation > 37 completed weeks). 

• Index test: Any measure of weight or growth at birth, 
including: absolute birthweight (thresholds <2.5 kg, 
<2.0 kg, <1.5 kg); population or customised centUe charts 
(thresholds <10th centile, <5th centile, <3rd centile); pond- 
eral index or other growth ratios. 

• Outcome: Any measure of compromise of neonatal, 
childhood, or adult weUbeing, such as: mortality; neonatal 
morbidity, including hypoxic ischemic encephalopathy; 
childhood or adult motor disability; childhood or adult 
disease, including diabetes meUitus, cardiovascular disease, 
and hypertension. 

• Study design: Observational studies that allowed the gen- 
eration of a 2 X 2 table (true positives, false positives, false 
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negatives, and true negatives) to compute an estimate of 
the association between test result and outcomes. Studies 
with five or fewer individuals were excluded on account of 
unreliability. 

All articles were carefully examined to identify duplica- 
tions in population. Where this was identified, the most 
recent and complete versions of the work were selected. 
There was no language restriction in study selection. The 
reference lists of selected studies and review articles were 
scrutinised and additional relevant articles were obtained. 
Information was extracted from the selected articles in 
duplicate (G.L.M. and R.K.M.) using a data collection 
sheet. Data were extracted on study characteristics (includ- 
ing the threshold values used), quality, and results, and 
were entered onto an excel spreadsheet. Data were used to 
construct 2x2 tables of the association between the mea- 
sure of growth at birth using the threshold reported in the 
article and the postnatal outcome for each individual. If 
results for multiple thresholds were reported, we sought to 
construct a separate 2x2 table for each threshold. In 
studies where data were felt to be relevant but 2x2 tables 
could not be constructed, or the outcome or population 
reported in the article did not meet the specific inclusion 
criteria, the authors were contacted. The study was not 
included unless the specific data could be provided. Diffi- 
culties in data extraction were resolved by seeking input 
from a third reviewer (K.S.K.). From the overall data set, 
the subset of studies reporting neonatal adverse outcomes 
was selected for inclusion in this report. 

Study quality assessment 

All articles meeting the selection criteria were assessed for 
methodological quality, defined as confidence that the 
study design, conduct, and analysis minimised any bias in 
the estimation of an association. We assessed quality using 
the complete STARD and QUADAS checklists. These are 
validated for the reporting and methodological quality of 
diagnostic test accuracy studies, and we selected the quality 
elements that were felt to be most relevant for this review 
on prognostic tests and associations.^*"'^ We did not assign 
a quality score, as this been shown to give flawed results.'* 
We considered cohort study design to be superior to case- 
control design. A study was rated high quality if it had at 
least four of the following items: an adequate description 
of the population; an adequate description of the test (defi- 
nition of low birthweight) and the outcome measure; con- 
secutive recruitment; prospective recruitment; >90% 
completions of foUow-up; appropriate outcome measure- 
ment; blinding of the investigators performing the outcome 
measure, and a statement regarding the use of intervention 
between the index test and outcome. A study was deemed 
to be of medium quality when three criteria were met and 
low quality if two or less were adhered to. 



Data synthesis for prognostic association 

The 2x2 tables were used to compute odds ratios (ORs) 
and 95% confidence intervals (95% CIs) for each index 
test-outcome pair, and the results were pooled for each 
index test (considering each definition and threshold of 
growth as a separate test) using meta-analysis. The OR was 
selected as the summary statistic, as it represents the effect 
of the exposure on the odds in an unbiased fashion and 
enables the results of both case-control and cohort studies 
to be included.'^ It is frequently used to demonstrate an 
epidemiologic association,'^ and here it provides a measure 
of a test's prognostic ability. 

With clinical and statistical heterogeneity expected 
between studies, a random-effects model was used 
throughout to account for this, which synthesises the log- 
arithmic odds ratio estimates for each test and weights 
each study by the inverse of the variance within the study 
plus between-study variance. This method provides a 
summary estimate of the average prognostic effect of a 
test. As the prognostic ability of a test may vary from this 
average from setting to setting, after each random-effects 
meta-analysis, if f > 0% we also estimated a prediction 
interval (EPI). This reveals the potential prognostic associ- 
ation if the test is applied in a single setting similar to 
one of the studies from our analysis."* EPI was calculated 
where three or more studies were included in the 
meta-analysis. 

We plotted summary OR data in forest plots and 
assessed the between-study heterogeneity in the prognostic 
association for each test by estimating f (the level of vari- 
ability in prognostic effects arising from between-study 
heterogeneity)" and (the among-study variance of the 
true prognostic effect).^" Where the number of studies 
reporting a given birthweight standard and outcome 
allowed, we performed subgroup analysis to examine the 
effect of potential confounding factors. Singleton or multi- 
ple birth status, ethnicity, exclusion of congenital anoma- 
lies, birth of the study population during or after 1990 
(because of recent advances in antenatal and neonatal 
care), and study quality were considered to be important 
factors that may influence the strength of the association 
between low birthweight and adverse outcome. 

In each study, when a table contained cells with a value 
of 0, 0.5 was added to all cells to allow the calculation of 
log ORs and their variances for meta-analysis.^' 
Meta-analyses were performed where two or more studies 
reported the same index test and outcome measure. The 
primary outcomes were considered to be neonatal mortality 
and a composite measure of neonatal neurological morbid- 
ity and non-neurological morbidity. A composite outcome 
measure for morbidity was employed to maximise the 
number of events that could be included in the analysis 
and avoid the need to select a single morbidity as a pri- 
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mary outcome measure; however, a hazard of composite 
outcome measures is the assumption that the significance 
of the result applies to all components.^^ To address this 
issue, we analysed the component outcomes as subgroups. 
When the composite outcome measure was used, care was 
taken to ensure that each individual was only counted once 
in each analysis, particularly where studies reported multi- 
ple outcomes for a single population. Where multiple out- 
comes were reported, attempts were made to select the 
outcome that was most consistent with the other studies: 
for example, in the neonatal non-neurological morbidity 
analysis, hypoglycaemia was the most commonly reported 
outcome and therefore this was selected primarily, followed 
by other conditions. To explore for the presence of funnel 
plot asymmetry (small study effects), and thus potential 
publication bias, the Peters test was performed in each 
meta-analysis containing at least ten studies. 

For the purposes of our meta-analyses, we used data 
where birthweight had been dichotomised around a thresh- 
old specified in the primary studies. In order to compare 
the effect of birthweight when it was analysed as a continu- 
ous variable, we examined all of the included studies where 
logistic regression analysis had been performed with birth- 
weight included as a continuous variable, and qualitatively 
summarised the findings. 

Data synthesis for predictive ability 

Where there was a strong and statistically significant prog- 
nostic association between a test and an outcome measure 
(defined by an OR > 5, with a 95% CI > 1), we went on 
to calculate sensitivity, specificity, and likelihood ratios, 
again using data from the 2x2 tables and synthesising 
predictive measures using a bivariate random-effects 
meta-analysis model. This allowed us to examine the pre- 
dictive ability of the test^'': that is, whether the test can 
accurately discriminate between those who do and those 
who do not have a poor outcome (as measured by sensitiv- 
ity and specificity), and how much a positive or negative 
test result modifies the odds of a poor outcome (as mea- 
sured by the positive and negative likelihood ratios). 

All analyses were performed in Stata 10.0 (StataCorp, 
College Station, TX, USA) using the metan, metandi, and 
metabias commands.^^"^^ Plots were generated using Stats- 
Direct. 

Results 

As shown in Figure 1, after an initial search of 36 956 cita- 
tions, we included 92 primary articles in the overall system- 
atic review, of which 29 contained data relating birthweight 
standards to neonatal outcomes. Five of these were 
included after contact with authors who provided data or 
information.^'*'^^"'''' In total, data were available for 



21 034 114 neonates. Details of the studies included are 
given in Table SI; a list of excluded studies is available 
from the authors upon request. A total of 145 further arti- 
cles were felt to contain potentially relevant data, but the 
authors could not be contacted, could not supply data to 
create 2x2 tables, or upon clarification regarding the 
population the study was excluded. If a study included 
infants of <37 weeks of gestation, it was only included if 
separate data regarding term infants was given or the 
authors provided this. A number of studies contained 
duplicate populations: where there was duplication of the 
test and outcome measure the least complete study was 
excluded from the review. If the population was the same 
but the measure of growth restriction or adverse outcome 
differed, then both studies were included, but care was 
taken not to include multiple studies reporting from the 
same population within a single meta-analysis, or within 
the overall count of the number of individuals included in 

1 • 7 42 

the review. ' 

The majority of studies used population growth chart 
below the tenth percentile (n = 17) or birthweight under 
2.5 kg (n = 9) as the index test that defined fetal growth 
restriction. A wide variety of neonatal outcome measures, 
including mortality and morbidity (e.g. seizures, hypother- 
mia, hypoglycaemia, respiratory distress), were reported. 
For comparison, we grouped outcomes according to mor- 
tality, neurological morbidity, and non-neurological mor- 
bidity. 

Prognostic association with neonatal mortality 

A forest plot of the summary meta-analysis odds ratios and 
95% confidence intervals for each measure of fetal growth 
restriction in relation to neonatal mortality is given in 
Figure 2. A birthweight below 1.5 kg showed the strongest 
association with neonatal mortality (OR 48.6, 95% CI 
28.62-82.53), with no between-study heterogeneity in this 
effect. Raising the birthweight threshold to 2.0, 2.5, or 
2.9 kg gradually reduced the association and increased the 
heterogeneity, but the summary effect estimate remained 
highly significant at each threshold. Population centile 
charts were also strongly associated with neonatal mortal- 
ity, but generally showed a weaker association at all thresh- 
olds than absolute birthweight, because the summary ORs 
were closer to 1 (Figure 2). 

Prognostic association with neonatal morbidity 

The association between measures of fetal growth restric- 
tion and neonatal morbidity are given in Figure 3. The 
analysis was subdivided into reported neurological morbid- 
ity (including seizures, hypoxic ischaemic encephalopathy, 
intraventricular haemorrhage) and non-neurological mor- 
bidity (including hypoglycaemia, respiratory distress syn- 
drome, cardiac failure), according to the definitions given 
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Total studies identified from electronic searches 
(database inception to January 2011) n = 36 956 



> Citations excluded n = 35 537 



Potentially relevant articles obtained in full manuscripts 
n = 1606 

Identified from electronic searches n = 1419 
Identified from manual reference list checking n = 187 



Articles excluded from review with reasons: 



Not a measure of low birthweight 


n 


= 166 


Inappropriate outcome measure 


n 


= 37 


Lack of original data (review articles/letters) 


n 


= 384 


Gestational age unreported 


n 


= 229 


Duplicate publications 


n 


= 15 


Incorrect population 


n 


= 106 


Data not extractable 


n 


= 564 


Papers unavailable 


n 


= 13 


Total excluded 


n 


= 1514 



Selected for inclusion in overall review 
Neonatal outcomes (this report) 
Test 


n = 92 (23 051 541 individuals) 
n = 29 (21 034 114 neonates) 
No. of studies No. of neonates 


Birthweight <2.5 kg 


9 


1 459 753 


Birthweight other thresholds 


8 


1 138 755 


Population growth chart <10* percentile 


17 


19 494 993 


Population growth chart other percentiles 


4 


113 707 


Customised growth chart <10'^ percentile 


1 


12 705 


Ponderal index 


3 


22 028 


Other measurements of fetal growth 


4 


80 118 



Figure 1. Study selection process for systematic review of the prognostic and predictive ability of current birthweight standards for short- and 
long-term outcomes. 



in the primary studies. A birthweight below 2.0 kg was 
most strongly associated with neurological morbidity 
(OR 17.34, 95% CI 5.63-53.70); however, this was based 
on a single study of 770 neonates. There was a significant 
association between weight below the third, fifth, and tenth 
centiles and neurological morbidity. A birthweight below 
the tenth centile according to a customised growth chart 
and a ponderal index of < 2.25 did not show a significant 
association with this outcome. For non-neurological mor- 
bidity, birthweights below the third, fifth, or tenth centiles 
on population chart and birthweights more than 2SD 
below the population mean showed significant association 
with this outcome, with summary odds ratios of a similar 
magnitude. Subgroup analysis for individual morbidities 



was only possible for birthweight below the tenth centile 
on the population chart and neonatal hypoglycaemia (any 
threshold, three studies, OR 3.72, 95% CI 0.85- 
16.19),''^^'''^ and seizures (two studies, OR 2.35, 95% CI 
1.58-3.49).*''*^ 

Quality assessment 

The results for the quality assessment are presented in 
Table 1. The majority of the studies included were of 
cohort design (97%), and most were retrospective studies 
(73%). Most studies were of high or moderate quality 
according to our pre-specified criteria. Studies often failed 
to adequately describe the test or outcome in a way that 
would make them reproducible, and very few studies 
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Definition 

Birth weight <i5/(g«9 ".99.iii5 4 

Birth weight <2. Olf^^-^"- ™ 3 



Birth weight <2.5/(g«53''" '"''»ara5, 

106,118, 



Birth weight <2.9t<ig^' 1 
Population chart < 3rd centile'"" 1 

Population chart < 5th centile^"" 1 



Population chart <10th centile'^' 1 3 

5e,65,67-69,74,82,1 00,1 10,11 5 



No. of No. of 
Studies Individuals 



Ponderai index (weight 
(g)/100x length(cm)=>) <2.25"^ 

Fetal growth ratio' <0. 75" 



1 

1 

Fetal growth ratio' <0.80" 1 
Birth weight < mean -1SD" 2 

Birth weight < mean -2SD" 1 



a = observed weight/ 

population mean for 

gestational age 

b = estimated prediction 

interval 



1,441,363 
539,124 
1,458,983 

9201 
82,361 

82,361 
19,444,776 

21,508 
5,305 
5,305 

70,398 

15,596 



Year of birth 

1961-2005 

1958-1998 
1957-2005 

1981-1983 
1988-1996 

1988- 1996 
1981-2006 

1981-1984 
1980-1986 
1980-1986 

1989- 1993 

Unreported 

I 



~r 




~r 



~\ — r~ 

5 10 



~r 



0.2 0.5 1 2 5 10 100 

odds ratio (95% confidence interval) 



Odds ratio (95% CI) 

48.6(28.62, 82.53) 
P = 0, Tau^ = 0 

19.12(4.54, 80.59) 

F = 88, Tau^ = 1.35, EPf too large 

8.46(6.25 11.46) 

P = 77, Tau' = 0.12, EPf 3.34, 21.45 
2.64 (1.45 4.82) 

8.01 (3 78 17.0) 



623(3.08 12.61) 
4.11 (3.70, 4.56) 

P=18, Tau' = 0.005 EPI" 3.37, 5.02 
3.57(2.21, 5.79) 

7.40(0.38, 144.11) 

3358(303, 371.43) 

586(4.19, 8.22) 

P = 55, Tau' = 0.033 

11.98(7.85, 18.27) 



1 

1000 



Figure 2. Forest plot of odds ratios for the association between birtlnweiglit standards and neonatal mortality. 



described any interventions that were performed between 
the time of the birthweight measurement and the outcome 
test. Where possible a subgroup analysis using only 
high-quality studies was performed, and the results are pre- 
sented in Table 2. 

Subgroup analyses of prognostic association 

The results for subgroup analyses to address potential con- 
founding factors of the association between birthweight 
and adverse neonatal outcome, within the meta-analysis 
groups for each birthweight standard, are presented in 
Table 2. No subgroup analyses were possible for neonatal 
morbidity according to these criteria. Too few studies 
reported ethnicity in enough detail to permit subgroup 
analysis. Limiting to a singleton population slightly weak- 
ened the association between birthweight below 1.5 kg and 
neonatal death, but did not affect the association between 
birthweight below 2.5 kg and the same outcome. 

Birthweight as a continuous variable 

None of the included studies that considered neonatal out- 
comes examined birthweight as a continuous variable via 
logistic regression analysis, so it is not possible to comment 
on this further. 



Direct comparison of prognostic association for 
absolute versus population centiles 

Only one study directly compared absolute birthweight and 
centile on population chart in the same population. For 
neonatal mortality, a birthweight below 2.9 kg had an odds 
ratio of 2.64 (95% CI 1.45-4.82) and a birthweight below 
the tenth centile on the population chart had an odds ratio 
of 5.31 (95% CI 2.85-9.89) for the same outcome.^** 

Publication bias for prognostic association results 

To examine funnel-plot asymmetry (small study effects), 
and thus the potential for publication bias, the Peters test 
was applied to the only meta-analysis containing ten or 
more studies (birthweight below tenth centile and neonatal 
mortality). There was no significant evidence of small study 
effects in this group (_P = 0.996). 

Predictive ability of standards of low birth weight 
to predict neonatal death 

The outcome that had the strongest prognostic association 
overall with low birth weight was neonatal death. For 
birthweight tests with a large (OR > 5) and statistically sig- 
nificant prognostic association with this outcome, their 
predictive ability for individual babies was summarised by 
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Definition 


No. of No. of 

Studies Individuals Year of birth 


Neurological morbidity 








Birth weight <2mg™ 


1 


770 


Unreported 


Birth weight <2.5t<g'^'> 


1 


770 


Unreported 


Population chart <3rd centile"'''™ 


2 


112,695 


1988-2001 


Population chart <5th centile^"" 


2 


83,105 


1988-1997 


Population chart <10th centile"^^'-"-™ 4 


217,122 


1988-2006 


Customised chart <10th centile"^ 


1 


12,705 


2001-2005 


Ponderai Index 

(weight(g)/100x length cm') <2.25<" 
Non-neurological morbidity 


1 


20 


Unreported 


Birth weight <2.0i<g™ 


1 


770 


Unreported 


Birth weight <2.5i<g™ 


1 


770 


Unreported 


Population chart <3rd centlle^" 


1 


30,334 


2000-2001 


Population chart <5th centiie"-^"" 


2 


1012 


1988-1997 


Population chart <10th centlle^'^^-'"'^ 


8,80, g 


44,751 


1984-2005 


Customised chart <1 0th centile" 


1 


12,705 


2001-2005 


Ponderai Index (1 00 x weight (g) 
/iength cm') <10th centile'' 
Birth weight < mean-2SD"'' 


1 
1 


500 
4,415 


Unreported 
1978 



a = estimated prediction 
interval 



Odds ratio (95% CI) 

17.34(5.63, 53.70) 

4.28 (1.23, 14.92) 

3.54 (2.28, 5.52) 
P 0, Tau' 0 

2.44 (1.50, 3.98) 
P 0, Tau^O 
212(1.56, 2.91) 
P 0, Tau^ 0 

1.87(0.99, 3.55) 
14.54(0.67, 316.7) 

1.66(0.76. 3.65) 
2.03(0.71, 5.82) 

2.56 (1.65,3.97) 

5.08(1.70, 15.2) 

P 64, Tau^ 0.40 

298 (1.58, 5.61) 

P 58, Tau^ 0.32, EPF 0.49, 18.04 

2.49(0.79, 7.82) 

3.57(1.34, 9.50) 

5.34 (2.96, 9.62) 
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Figure 3. Forest plot of odds ratios for tine association of birthweight standards with neonatal morbidity. 



using meta-analysis to calculate summary sensitivity, speci- 
ficity, and Itkelihood ratios (Table 3). These measures 
reveal the discriminative ability of each test and how test 
results modify a baby's odds of having a neonatal death. 
For each test the specificities and positive likelihood ratios 
were high, but the sensitivity and negative likelihood ratios 
were generally poor (Table 3). This can be explained by the 
fact that although a higher proportion of deaths occurred 
within the low birthweight group, because this group 
represents a small fraction of the overall population, a large 
absolute number of deaths stiU occurred within the normal 
weight groups, and therefore sensitivity is low and the 'false 
negative' numbers are high, giving a poor negative likeli- 
hood ratio (close to 1). For example, the highest positive 
likelihood ratio was for birthweights below 1.5 kg, indicat- 
ing that any baby under this weight multiplied their pre-t- 
est odds of neonatal death by 49.1 (95% CI 27.3-88.5); 
however, the negative likelihood ratio was only 1.01 (1.00- 
1.01), indicating that the odds of death barely change after 
a negative test result. Thus, although a birthweight below 
1.5 kg substantially increases the odds of a poor outcome, 
a birthweight above 1.5 kg does not increase the odds of a 
good outcome. 



Discussion 

Main findings 

Low birthweight showed a strong, consistent association with 
neonatal mortality. The relationship was highest at lower 
thresholds and gradually decreased (but remained strong) as 
the threshold increased. The absolute birthweight seemed to 
be more strongly related to this outcome than centiles on 
population weight charts, especially for thresholds of 1.5 and 
2.0 kg. Restricting the analysis to singletons, year of birth 
since 1990, or by country of origin did not change the mag- 
nitude of the association. Other definitions of fetal grovrth 
restriction were based on single studies and showed mixed 
results, but none appeared to be more strongly associated 
with neonatal mortality than the absolute birthweight. The 
results for neonatal morbidity were mixed, but no single def- 
inition of growth restriction appeared to be consistently 
more strongly associated with adverse outcomes than others. 
All of the birthweight and population chart thresholds 
assessed for predictive ability showed a high specificity and 
positive likelihood ratio for neonatal death, and thus babies 
who test positive are at a substantially higher risk of neonatal 
mortality. However, each test generally had a low sensitivity 



© 2014 The Authors BJOG An International Journal of Obstetrics and Gynaecology published by John Wiley & Sons Ltd on behalf of 
Royal College of Obstetricians and Gynaecologists 



521 



Malin et al. 



and negative likelihood ratio close to 1, and thus a negative 
test result does not improve the odds that a baby will not 
have a neonatal death. 

Strengths and limitations 

This review provides the best available evidence, at the time 
of writing, regarding the association between different mea- 

Table 1. Methodological quality of studies included in systematic 
review of birthweight standards for neonatal outcomes 



Quality item Number (%) of studies n = 29 





Yes 


No 


Unclear 


Cohort study design 


28 (97) 


0 


1 (3) 


Population adequately 


28 (97) 


0 


1 (3) 


described 








Consecutive recruitment 


22 (75) 


1 (3) 


5(21) 


Prospective recruitment 


6(21) 


21 (73) 


2(6) 


Appropriate outcome 


29 (100) 


0 


0 


measure 








Outcome measure blinded 


0 


0 


29 (100) 


>90% of individuals had 


26 (90) 


0 


3 (10) 


outcome measure 








Index test and outcome 


14 (48.5) 


1 (3) 


14 (48.5) 


measure described 








Intervention between index 


1 (3) 


0 


28 (97) 


test and outcome 








Quality classification 








High 


24 (83) 






Medium 


4 (14) 






Low 


1 (3) 







sures of fetal growth restriction and adverse outcomes. No 
other review has attempted to compare different definitions 
of growth restriction to inform clinical practice. The 
strength of our review and the validity of our inferences lie 
in the methodology used. We have complied with existing 
guidelines for the reporting of systematic reviews of diag- 
nostic and observational studies. '^'^^ We have used the 
most up to date techniques for performing and interpreting 
meta-analysis. ^''"^^ An extensive literature search was per- 
formed in relevant databases with no language restrictions 
applied. Every effort was made to obtain the most complete 
data set possible through contact with authors and experts 
in the field. Peters test showed that there was no evidence 
of small study bias within our largest meta-analysis; other 
groups were too small to assess. We also considered both 
the prognostic association of birthweight tests with out- 
come (as summarised by an odds ratio) and their predic- 
tive ability (as summarised by sensitivity, specificity, and 
likelihood ratio). 

There are several limitations to our review. Different 
numbers of studies contributed to each analysis, and there 
were few direct comparisons. Indeed, in the only study that 
compared absolute birthweight and centUe chart in the 
same population, the association for birthweight below the 
tenth centile was observed to be stronger than the associa- 
tion with absolute birthweight below 2.9 kg, for this out- 
come. There was a lack of data in some analyses, e.g. 
customised centile charts and ponderal index in relation to 
adverse outcome, but as every effort was made to acquire 
both published and unpublished data we do not feel that 
anything further could be done to address this. Although 



Table 2. Subgroup analysis according to birthweight standard and neonatal mortality, where possible, for study quality, year of birth of study 
population, location of study, and singleton population 

Birth weight standard Number of studies Subgroup OR (95% CI) Estimated /^ 

prediction 
interval (EPI) 



Neonatal death 
















Birthweight <1 .5 kg 


231,46,48 


High-quality studies 


53.29 (30.08-94.39) 






/2 = 0, = 


0 


Birthweight <1 .5 kg 


231,32 


Singletons 


41.85 (16.53-105.94) 






F = 0, = 


0 


Birthweight <2.5 kg 


^31,32,43,52 


Singletons 


8.39 (4.90-14.35) 


0.86- 


-81.35 


F = 8], 


= 0.20 


Birthweight <2.5 kg 


^31,43,45,46,48 


High-quality studies 


8.15 (5.76-11.54) 


2.40- 


-27.65 


F = 80, 


= 0.12 


Birthweight <2.5 kg 


246,48 


Year of birth > 1990 


9.74 (5.31-17.85) 






= 91, 


= 0.17 


Population chart 


g7,34,36,38,40,47 


Singletons 


4.03 (3.88-4.18) 






F = 0, = 


0 


<10th centile 
















Population chart 


g7, 33, 34,36,40,50,51 


Year of birth > 1990 


4.23 (3.73^.81) 


3.23- 


-5.55 


F = 3^.T^ 


= 0.01 


<10th centile 
















Population chart 


47,36,47,50 


Congenital anomalies excluded 


4.01 (3.86-4.16) 






F = 0, = 


0 


<10th centile 
















Population chart 


g7, 34,36,47,50,51 


Studies in USA/Europe 


4.04 (3.89^.19) 






F = 0, = 


0 
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Table 3. Results for the predictive ability (sensitivity, specificity, and likelilnood ratios) of different birthweight standards for neonatal mortality 



Birthweight standard 


Sensitivity (95% CI) 


Specificity (95% CI) 


Positive likelihood 
ratio (95% CI) 


Negative likelihood 
ratio (95% CI) 


Birthweight <1. 5 kg^^-^^'*'^'*'' 
Birthweight <2.0 kg^^-"^'" 

Birthweight <2.B kg^l ■32.43,45,46,48,49,52 

Population chart <3rd centile*^ 
Fetal growth ratio <0.80** 
Birthweight < mean - 2 SD^'^ 


0.008 (0.004-0.146) 
0.05 (0.03-0.07) 
0.31 (0.19-0.47) 
0.24 (0.12-0.41) 
0.67 (0.09-0.99) 
0.13 (0.09-0.19) 


0.99 (0.99-1.00) 
0.99 (0.99-1.00) 
0.94 (0.88-0.97) 
0.96 (0.96-0.96) 
0.94 (0.93-0.95) 
0.99 (0.99-0.99) 


49.1 (27.3-88.5) 
13.3 (2.27-78.28) 
5.27 (3.57-7.76) 
6.31 (3.57-11.14) 
11.9 (3.87-32.52) 
10.53 (7.25-15.28) 


1.01 (1.00-1.01) 
0.94 (0.85-1.02) 
1.37 (1.15-1.62) 
0.79 (0.66-0.94) 
0.36 (0.07-1.75) 
0.88 (0.83-0.92) 



every effort was made to control for potential confounding 
factors through subgroup analysis, because of the quality 
and reporting of the primary studies this was not always 
possible. We strictly limited our review to infants born at 
37 weeks of gestation or later to avoid the confounding 
effect of preterm birth; however, the method of estimating 
gestation in the primary studies was often inaccurate. Very 
few studies used ultrasound measurement of crown— rump 
length at 10-13 weeks of gestation, which is the most accu- 
rate method^'': the majority used the mother's last men- 
strual period and some used a clinical examination of the 
newborn, which are less reliable and may have resulted in 
preterm infants being included inadvertently. We also rec- 
ognise that within the group of 'term' infants there is a 
continuing spectrum of gestational age and birthweight, 
and the risks are not equal, i.e. a baby at 37 weeks of gesta- 
tion will have a higher risk of adverse outcome than a baby 
at 40 weeks of gestation, irrespective of birthweight. How- 
ever, as the majority of studies did not report outcomes 
according to gestation and birthweight, we could not 
examine this issue further with the current data. Current 
clinical practice tends to group infants of 37 weeks of ges- 
tation and over together in the way that they are managed, 
so we feel that the approach in this review remains valid. 

As a result of poor reporting in the primary studies, our 
ability to perform subgroup analysis according to ethnicity 
was limited. It is known that Afro-Caribbean and Asian 
populations have smaller babies, and therefore it is likely 
that the same thresholds would not give the same results in 
all ethnic backgrounds.^" We did not analyse according to 
social class: again this was not possible with the informa- 
tion available. We limited the population to singletons 
where possible, and found that this did not significantly 
affect the results. We also recognised that the year of birth 
may be an important factor in neonatal outcome, particu- 
larly mortality, because of advances in neonatal care, and 
therefore performed an analysis limited to studies where 
the population was born in or after 1990. This did not sig- 
nificantly alter the odds ratios for either birthweight below 
the tenth centile on the population chart or birthweight 



below 2.5 kg, the only groups for which this analysis was 
possible. Customised charts may perform best in sub- 
groups, such as women who are obese, and this type of 
analysis was not possible.*' 

Comparing different standards of birthweight through 
analyses using different populations may not give a true 
result; however, no studies reported more than two stan- 
dards in the same population, and only one study com- 
pared absolute birthweight and population centile charts, 
thereby limiting our ability to deal with this issue. 

We attempted to consider all clinically important out- 
comes within this review; however, one important adverse 
outcome of fetal growth restriction that has been omitted 
is stillbirth. This exclusion was made because the remit of 
the project was to look at parameters of weight at birth 
and subsequent adverse outcome, rather than tests per- 
formed in the antenatal period. We also felt that there was 
too much potential for confounding to examine the associ- 
ation between birthweight and stillbirth, given that stillbirth 
may occur days or weeks prior to delivery, and therefore 
lead to the inclusion of premature infants in the analysis. 

Interpretation 

There is a vast literature exploring the relationship between 
fetal growth restriction and adverse outcomes, using differ- 
ent methodologies to do so. The aim of our review was to 
consider the association and prediction of different thresh- 
olds of birthweight or centile charts, and we therefore 
excluded studies where 2x2 tables could not be obtained 
from the original article or authors could not provide this. 
We therefore could not make a complete assessment of the 
association of birthweight as a continuous variable with 
adverse health outcomes. In order to address this we con- 
sidered whether the studies included in the review had 
examined the association between a continuous birthweight 
measure and adverse outcomes via logistic regression analy- 
sis; however, no studies relating to neonatal outcomes had 
performed this analysis. We did not identify any other 
systematic reviews attempting to compare different stan- 
dards of low birthweight with neonatal outcomes. 
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Our meta-analysis confirms that birthweight has a strong 
prognostic association with neonatal mortality, with low 
birthweight substantially increasing the risk of a poor out- 
come. However, although specificity and positive likelihood 
ratios were excellent, sensitivity was usually <0.5 and nega- 
tive likelihood ratios were close to 1. This means that, 
compared with the pre-test risk of neonatal death (preva- 
lence), babies with a low birthweight (test positive) are at a 
substantially increased risk, but the risk for those with a 
normal birthweight (test negative) does not change. 

Recommendations 

Future research is necessary to establish whether there is a 
birthweight standard that can accurately predict adverse 
neonatal outcomes. Initially, it is important to compare the 
different standards across the same population to enable an 
unbiased comparison, and to further explore the standards 
that were less frequently reported and therefore could not 
be included in the meta-analysis within our review, such as 
ponderal index and customised centile charts. This could 
be performed through an individual patient data (IPD) 
meta-analysis, where multiple definitions of fetal growth 
restriction could be compared across the same population, 
and factors such as ethnicity more adequately assessed.*^ 
Important factors to consider in any future IPD analysis 
are that of the accurate estimation of gestational age (i.e. 
pregnancies dated by first-trimester ultrasound scan only) 
and comparing outcome by week of gestational age rather 
than grouping all term infants together. Another option 
would be to perform further analysis on the large Scandi- 
navian birth registries, which record a variety of birth 
anthropometry that can be linked to health outcomes.*^ 

Finally, it is likely that more accurate risk predictions 
could be made using birthweight as a continuous variable, 
rather than dichotomising it using a threshold, as is cur- 
rently the general practice.** The use of measures of func- 
tional growth rather than weight alone, such as body 
composition or metabolic parameters, may help to differ- 
entiate between infants who are small because of growth 
restriction, and therefore might be at higher risk of adverse 
outcome, and those who are constitutionally small.''^ 

Conclusion 

Birthweight tests are strongly associated with neonatal mor- 
tality and morbidity, especially at lower absolute birth- 
weight thresholds, and babies that test positive (i.e. 
abnormal growth) are at a substantially increased risk of 
neonatal mortality; however, babies who test negative (i.e. 
normal growth) do not have a decreased risk of neonatal 
mortality. Further research is required to identify the 
optimum definition of low birthweight that helps best pre- 
dict the risk of adverse outcomes, and this may require 



using birthweight as a continuous variable, developing 
prognostic models that also contain other factors, and 
using individual patient data meta-analysis. 
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